Predicting the Running Times of Parallel Programs by Simulation

نویسندگان

  • Radu Rugina
  • Klaus E. Schauser
چکیده

Predicting the running time of a parallel program is useful for determining the optimal values for the parameters of the implementation and the optimal mapping of data on processors. However, deriving an explicit formula for the running time of a certain parallel program is a difficult task. We present a new method for the analysis of parallel programs: simulating the execution of parallel programs by following their control flow and by determining, for each processor, the sequence of send and receive operations according to the LogGP model. We developed two algorithms to simulate the LogGP communication between processors and we tested them on the blocked parallel version of the Gaussian Elimination algorithm on the Meiko CS-2 parallel machine. Our implementation showed that the LogGP simulation is able to detect the nonlinear behavior of the program running times, to indicate the differences in running times for different data layouts and to find the local optimal value of the block size with acceptable precision.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gpu Accelerated Parallel Branch Prediction for Multi/many-core Processor Simulation

Branch Prediction is a common function in nowadays microprocessors. Branch predictor is duplicated in each core of a multi/many-core processor and makes prediction for multiple concurrent running programs respectively. To evaluate the parallel branch prediction in a multi/many-core processor, existing schemes generally use a parallel simulator running on a CPU that does not have a real massive ...

متن کامل

arallel Algorithm for Compile-Time Scheduling of Parallel Programs on Multiprocessors

In this paper, we propose a parallel randomized algorithm, called Parallel Fast Assignment using Search Technique (PFAST), for scheduling parallel programs represented by directed acyclic graphs (DAGs) during compile-time. The PFAST algorithm has O ( e ) time complexity where e is the number of edges in the DAG. This linear-time algorithm works by first generating an initial solution and then r...

متن کامل

Predicting Parallel Applications’ Performance Across Platforms Using Partial Execution

Performance prediction across platforms is increasingly important in today’s diverse computing environments. As both programs and their developers face unprecedented wide choices in execution platforms, cross-machine execution time prediction with reasonable accuracy equally benefits scheduling decisions of grid jobs as well as scientists in their research and development planning. In this pape...

متن کامل

Prediction of sprint active women performance using vertical and leg stiffness

Introduction: The ability to attain high level of speed is an essential component of success in many sports fields. However, physical qualities that underpin this ability remain unclear. The purpose of this study was to investigate the role of leg stiffness and vertical stiffness in predicting sprint running. Methods: 50 healthy females were selected among physical education students. The requi...

متن کامل

Analysis of Memory and Time Savings Using EC/DSIM

This paper introduces the EC frontend and DSIM simulator. Given a parallel program, they determine its execution time on huge networks of computers. EC extracts task step needs. DSIM predicts completion times rather than simulating each program step. This paper contains analyses of the memory savings and the execution time savings for simulations of one to 2,800 computers running parallel Gauss...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998